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Description 

Reld of the Invention 

5 This invention relates to expression vectors containing a DNA sequence from the human 
cytomegalovirus major immediate early gene, to host cells containing such vectors, to a method of 
producing a desired polypeptide by using vectors containing said sequence and to the use of said DNA 
sequence. 

70 Background to the Invention 

The main aim of workers In the field of recombinant DNA technology is to achieve as high a level of 
production as possible of a particular polypeptide. This is particularly true of commercial organisations who 
wish to exploit the use of recombinant DNA technology to produce polypeptides which naturally are not 
75 very abundant. 

Generally the application of DNA technology involves the cloning of a gene encoding the desired 
polypeptide, placing the cloned gene in a suitable expression vector, transfecting a host cell line with the 
vector, and culturing the transfected cell line to produce the polypeptide. It Is almost impossible to predict 
whether any particular vector or cell line or combination thereof will lead to a useful level of production. 
20 In general, the factors which significantly affect the amount of polypeptide produced by a transfected 
cell line are: 1. gene copy number, 2. efficiency with which the gene is transcribed and the mRNA 
translated, 3. the stability of the mRNA and 4. the efficiency of secretion of the protein. 

The majority of work directed at increasing expression levels of recombinant polypeptides has focussed 
on improving transcription initiation mechanisms. As a result the factors affecting efficient translation are 
25 much less well understood and defined, and generally it is not possible to predict whether any particular 
DNA sequences will be of use in obtaining efficient translation. 

Attempts at investigating translation have consisted largely of varying the DNA sequence around the 
consensus translation start signal to determine what effect this has on translation Initiation (Kozak M. Cell 41 
283-292 (1986)). 

30 Studies involving expression of desired heterologous genes normally use both the coding sequence and 
at least part of the s'-untranslated sequence of the heterologous gene such that translation initiation is from 
the natural sequence of the gene. This approach has been found to be unreliable probably as a result of the 
'hybrid nature* of the s'-untranslated region and the fact that the presence of particular 5-untranslated 
sequences can lead to poor initiation of translation (Kozak M. Prod. Natl. Acad. Sci. 83 2850-2854 (1986) 

35 and Pelletier and Sonenberg Cell 40 515-526 (1985)). This variation in translation has a detrimental effect on 
the amount of the product produced. 

Previous studies (Boshart et al Cell 411 521-530 (1985) and Pasleau et al, Gene 38 227-232 (1985): 
Stenberg et al, J. Virol ^ (1) 190-199 (1984); Thomson et al Proc. Natl. Acad. Scl. USA 81 659-663 (1984) 
and FoeckTng'and Hofstetter Gene 45 101-105 (1986)) have used sequences from the upstream region of 

40 the hCMV-MIE gene in expression vectors. These have, however, solely been concerned with the use of the 
sequences as promoters and/or enhancers. Spaete and Mocarski (J. Virol 56 (1) 135-143, 1985) have used 
a PstI to PstI fragment of the hCMV-MIE gene encompassing the promoter, enhancer and part of the 5'- 
untranslated region, as a promoter for expression of heterologous genes. In order to obtain translation the 
natural 5'-untranslated region of the heterologous gene was used. 

45 In published European Patent Application No. 260148. a method for the continuous production of a 
heterologous protein is described. The expression vectors constructed contain part of the 5'-untranslated 
region of the hCMV-MIE gene as a stabilising sequence. The stabilising sequence is placed in the S'- 
untranslated region of the gene encoding the desired heterologous protein i.e. the teaching is again that the 
natural 5'-untranslated region of the gene is essential for translation. 

50 

Summary of the Invention 

In a first aspect the invention provides a vector containing a DNA sequence comprising the promoter, 
enhancer and functionally complete 5'-untranslated region including the first intron of the human 
55 cytomegalovirus major immediate early gene, wherein the 5*-untranslated region is not linked directly to the 
DNA coding sequence of the natural human cytomegalovirus major immediate early gene. 

In a preferred embodiment of the first aspect of the invention, the vector includes a restriction site for 
insertion of a heterologous gene. 
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The present invention is based on the discovery that vectors containing a DNA sequence comprising 
the promoter, enhancer and complete 5*-untranslated region of the major Immediate early gene of the 
human cytomegalovirus (hCMV-MIE) upstream of a heterologous gene result in high level expression of the 
heterologous gene product. In particular, we have unexpectedly found that when the hCMV-MIE derived 

5 DNA is linked directly to the coding sequence of the heterologous gene high levels of mRNA translation are 
achieved. This efficient translation of mRNA is achieved consistently and appears to be independent of the 
particular heterologous gene being expressed. 

In a second aspect the invention provides a vector containing a DNA sequence comprising the 
promoter, enhancer and functionally complete 5'-untranslated region including the first intron of the human 

10 cytomegalovirus major immediate early gene, wherein the 5'-untranslated region is linked directly to the 
DNA coding sequence of a heterologous gene. 

The hCMV-MIE derived DNA according to the second aspect of the Invention may be separated from 
the coding sequence of the heterologous gene by Intervening DNA such as for example by the 5*- 
untranslated region of the heterologous gene. 

75 Preferably the hCMV-MIE derived sequence includes a sequence identical to the natural hCMV-MIE 
translation initiation signal. It may however be necessary or convenient to modify the natural translation 
initiation signal to facilitate linking the coding sequence of the desired polypeptide to the hCMV-MIE 
sequence, i.e. by introducing a convenient restriction enzyme recognition site. For example the translation 
Initiation site may advantageously be modified to provide an Ncol recognition site. 

20 The heterologous gene may be a gene coding for any eukaryotic polypeptide such as for example a 
mammalian polypeptide such as an enzyme, e.g. chymosin or gastric lipase; an enzyme inhibitor, e.g. 
tissue inhibitor of metalloproteinase (TiMP); a hormone, e.g. growth hormone; a lymphoklne, e.g. an 
interferon; a plasminogen activator, e.g. tissue plasminogen activator (tPA) or prouroklnase; or a natural, 
modified or chimeric immunoglobulin or a fragment thereof including chimeric immunoglobulins having dual 

25 activity such as antibody-enzyme or antibody-toxin chimeras. 

According to a third aspect of the invention there is provided host cells transfected with vectors 
according to the first or second aspect of the Invention. 

The host cell may be any eukaryotic cell such as for example plant, or insect cells but Is preferably a 
mammalian cell such as for example CHO cells or cells of myeloid origin e.g. myeloma or hybridoma cells. 

30 In a fourth aspect the Invention provides a process for the production of a heterologous polypeptide by 
culturing a transfected cell according to the third aspect of the invention. 

In a fifth aspect the invention provides the use of a DNA sequence comprising the promoter, enhancer 
and subsequentially complete s'-untranslated region Including the first intron of the hCMV-MlE gene for 
expression a heterologous gene. 

35 In a preferred embodiment of the fifth aspect of the Invention the hCMV-MlE derived DNA sequence is 
linked directly to the DNA coding sequence of the heterologous gene. 

Also included within the scope of the invention are plasmids pCMGS, pHT.l and pEE6hCMV. 
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Brief Description of the Drawings 



The present invention Is now described, by way of example only, with reference to the accompanying 
drawings in which 

Rgure 1 : shows a diagrammatic representation of plasmid pSVLGS.1 
Rgure 2: shows a diagrammatic respresentation of plasmid pHT.1 
45 Figure 3: shows a diagrammatic representation of plasmid pCMGS 

Rgure 4: shows the complete sequence of the promoter-enhancer hCMV-MIE Including both the first 

intron and a modified translation 'start' site 
Rgure 5: shows a diagrammatic representation of plasmid pEE6.hCMV 



50 Detailed Description of the Embodiments 



Example 1 

The Pst-lm fragment of hCMV (Boshart et al Cell 41^ 521-530 (1985) Spaete & Mocarski J. Virol 56 (1) 
55 135-143 (1985)) contains the promoter-enhancer and most of the 5'-untranslated leader of the MIE gene 
including the first intron. The remainder of the 5' untranslated sequence can be recreated by attaching a 
small additional sequence of approximately 20 base pairs. 
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Many eukaryotic genes contain an Ncol restriction site (5'-CCATGG-3') overlapping the translation start 
site, since this sequence frequently forms part of a preferred translation initiation signal s'ACCATGPu-S'. 
The hCMV-MIE gene does not have an Ncol site at the beginning of the protein coding sequence but a 
single base-pair alteration causes the sequence both to resemble more closely the "Kozak" concensus 
5 initiation signal and introduces an Ncol recognition site. Therefore a pair of complementary oligonucleotides 
were synthesised of the sequence: 

GTCACCGTCCTTGACAC 
ACGTCAGTGGCAGGAACTGTGGTAC 

which when fused to the Pst-lm fragment of hCMV will recreate the complete s'-untranslated sequence of 
the MIE gene with the single alteration of a G to a C at position -1 relative to the translation initiation codon. 

75 This synthetic DNA fragment was introduced between the hCMV Pst-lm promoter-enhancer leader 
fragment and a glutamine synthetase (GS) coding sequence by ligation of the Pst-lm fragment and the 
synthetic oligomer with Ncol digested pSV2.GS to generate a new plasmid, pCMGS (The production of 
pSV2.GS is described in published International Patent Application No. WO 8704462). pCMGS is shown in 
Figure 3. pCMGS thus contains a hybrid transcription unit consisting of the following: the synthetic oligomer 

20 described above upstream of the hCMV-MIE promoter-enhancer (where it serves merely as a convenient 
Pst1 - Ncol "adaptor"), the hCMV-MIE promoter and the complete 5' untranslated region of the MIE gene, 
including the first intron, fused directly to the GS coding sequence at the translation initiation site. 
pCMGS was introduced into CHO-KI celts by calcium phosphate mediated transfectlon and the plasmid was 
tested for the ability to confer resistance to the GS-inhibitor methionine sulphoximine (MSX). The results of 

25 a comparison with pSV2.GS are shown in Table 1 . 

it is clear that pCMGS can confer resistance to 20 M MSX at a similar frequency to pSV2.GS, 
demonstrating that active GS enzyme is indeed expressed in this vector. 

Table 1 

30 



Results of transfectlon of GS-expresslon vectors Into CHO-Kl cells 


Vector 


no. colonies/10^ cells resistant to 20aM MSX 


PSV2.GS 


32 


pCMGS 


17 


Control 


0 



40 Example 2 

The TIMP cDNA and SV40 polyadenylation signal as used in pTIMP 1 Docherty et al (1985) Nature 
318, 66-69, was inserted into pEE6 between the unique Hindlll and BamHI sites to create~p'EE6TIMP. pEE6 
isa bacterial vector from which sequences inhibitory to replication in mammalian cells have been removed. 

45 It contains the XmnI to Bell portion of pCT54 (Emtage et al 1983 Proc. Natl. Acad. Sci. USA 80, 3671-3675) 
with a pSP64 (Melton eTal 1984: Nucleic Acids. Res.'l2r 7035) polylinker inserted in betwein the Hind lll 
and EcoRI sites. The BamHI and Sail sites have beenlemoved from the polylinker by digestion, filling in 
with Kienow enzyme and religationTThe Bell to Bam HI fragment Is a 237 bp SV40 early polyadenylation 
signal (SV40 2770 to 2533). The Bam HI toliie Bgll fragment is derived from pBR328 (375 to 2422) with an 

50 additional deletion between the Sal! and the AvaTsites (651 to 1425) following the addition of a Sail linker to 
the Aval site. The sequence frornThe Bgll to tFmXmnl site originates from the ^-lactamase geneof pSP64. 

The 2129 base-pair Ncol fragmenTcontaining the hCMV MIE promoter-enhancer and 5' untranslated 
sequence was Isolated from pCMGS by partial Ncol digestion and inserted at the Ncol site overlapping the 
translation initiation signal of TIMP in pEE6.TIMP to generate the plasmid pHT.1 (shown In Figure 2). 

55 A GS gene was introduced into pHT.1 to allow selection of permanent cell lines by introducing the 5.5K 
Pvul - BamHI fragment of pSVLGS.1 (figure 1) at the BamHI site of pHT.1 after addition of a synthetic 
BamHI linker to Pvul digested pSVLGS.1 to form pHT.IGS. In this plasmid the hCMV-TIMP and GS 
transcription units transcribe in the same orientation. 
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pHT.1 GS was introduced into CHO-KI cells by calcium-phosphate mediated transfection and clones 
resistant to 20uM MSX were isolated 2-3 weeks post-transfection. TIMP secretion rates were determined by 
testing culture supernatants in a specific two site ELISA. based on a sheep anti TIMP polyclonal antibody as 
a capture antibody, a mouse TIMP monoclonal as the recognit'on antibody, binding of the monoclonal being 

5 revealed using a sheep anti mouse IgG peroxidase conjugate. Purified natural TIMP was used as a standard 
for calibration of the assay and all curves were linear in the range of 2 - 20ng m\~\ No non-specific reaction 
was detectable in CHO-cell conditioned culture media. 

One cell line GS.19 was subsequently recloned, and a sub-clone GS 19-12 secretes TIMP at a very 
high level of 3 x 10^ molecules/cell/day. Total genomic DNA extracted from this cell line was hybridised 

10 with a TIMP probe by Southern blot analysis using standard techniques and shown to contain a single intact 
copy of the TIMP transcription unit per cell (as well as two re-anranged plasmid bands). This cell line was 
selected for resistance to higher levels of MSX and in the first selection a pool of cells resistant to SOOuM 
MSX was isolated and recloned . The clone GS-1 9.6(500)1 4 secretes 3 x 10* molecules TIMP/cell/day. The 
vector copy-number in this cell line is approx. 20 - 30 copies/cell. Subsequent rounds of selection for further 

75 gene amplification did not led to increased TIMP secretion. 

Thus it appears that the hCMV-TIMP transcription unit from plHT.1 can be very efficiently expressed in 
CHO-KI cells at approximately a single copy per cell and a single round of gene amplification leads to 
secretion rates which are maximal using current methods. 

20 Example 3 

In order to test whether the hCMV-MIE promoter-enhancer-leader can be used to direct the efficient 
expression of other protein sequences, two different but related plasminogen activator coding sequences 
(designated PA-1 and PA-2) were introduced into CHO-KI cells in vectors in which the protein coding 

25 sequences were fused directly to the hCMV sequence. 

In both these cases, there is no Ncol site at the beginning of the translated sequence and so synthetic 
oligonucleotides were used to recreate the authentic coding sequence from suitable restriction sites within 
the translated region. The sequence of the modified hCMV translation-initiation signal as used in pHT.1 was 
also built into the synthetic oligonucleotide which then ended in a Pst-1 restriction site. The Pst-lm 

30 fragment of hCMV was then inserted at this site to create the complete promoter-enhancer-leader 
sequence. 

The hCMV-plasminogen activator transcription units were introduced into CHO-KI ceils after inserting a 
GS gene at the unique Bam HI site as above and MSX resistant cell lines secreting plasminogen activator 
were isolated. 

35 The secretion rates of the best initial transfectant cell lines in each case are given in Table 2. From this 
it is clear that the hCMV promoter-enhancer leader can also be used to direct the efficient expression of 
these two plasminogen activator proteins. 

Table 2 

40 



Secretion rates of the different plasminogen activator proteins from transfectant OHO cell lines. 


Plasminogen activator 


Molecules secreted/cell/day 


PA-1 
PA-2 


5.5 X 10^ 
1.1 X 108 



Example 4 

50 

pEE6hCMV was made by ligating the Pst-lm fragment of hCMV. Hindlll -digested pEE6 and the 
complementary oligonucleotides of the sequence: 

55 GTCACCGTCCTTGACACGA 

ACGTCAGtGGCAGGAACTGTGCTTCGA 
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cDNA encoding an immunoglobulin light-chain was inserted at the EcoRI site of pEE6.hCMV such that the 
hCMV-MIE promoter-enhancer leader could direct expression of the cDNA and a selectable marker gene 
containing the SV40 origin of replication was inserted at the Bam HI site of each ptasmid. 

This plasmid was transfected into COS-1 monl<ey kidney cells by a standard DEAE-dextran transfection 
s procedure and transient expression was monitored 72 hours post transfection. Light chain was secreted into 
the medium at at least lOOng/ml indicating that light chain can Indeed be expressed from a transcription 
unit containing the entire hCMV-MIE 5'-untranslated sequence up to but not including the translation 
initiation ATG, followed by 15 bases of natural s'-untranslated sequence of the mouse immunoglobulin light- 
chain gene. 

10 

Claims 

1. A vector for use in eukaryotic cell lines containing a DNA sequence comprising the promoter, enhancer 
and functionally complete 5'-untranslated region including the first intron of the human cytomegalovirus 

75 major immediate-early gene, wherein the 5'-untranslated region is not linked directly to the DNA coding 
sequence of the natural human cytomegalovirus major immediate-early gene. 

2. A vector for use in eukaryotic cell lines containing a DNA sequence comprising the promoter, enhancer 
and functionally complete 5'-untranslated region including the first intron of the human cytomegalovirus 

20 major immediate-early gene, wherein the 5'-untranslated region is linked directly to the DNA coding 
sequence of a heterologous gene. 

3. A vector according to claim 2 wherein the vector Includes a restriction site to allow the excision and/or 
insertion of the coding sequence of a heterologous gene. 

25 

4. A vector according to claim 1 or claim 2 wherein the human cytomegalovirus major Immediate-early 
DNA includes a translation initiation signal. 

5. A host cell transfected with a vector according to any of the preceding claims. 

30 

6. A process for the production of a heterologous polypeptide comprising culturing a host cell according 
to claim 4. 

7. The use of a DNA sequence comprising the promoter, enhancer and functionally complete 5*- 
35 untranslated region including the first Intron of the human cytomegalovirus major immediate-early gene 

for expression of a heterologous gene in a eukaryotic cell line wherein the human cytomegalovirus 
major immediate-early gene derived DNA is linked directly to the coding sequence of the heterologous 
gene. 

40 8. A plasmid for use in a eukaryotic cell line comprising the human cytomegalovirus major immediate- 
early promoter and the functionally complete 5'-untranslated region of the major immediate-early gene. 
Including the first intron, fused directly to a glutamine synthetase coding sequence at the translation 
initiation site. 

45 9. A plasmid comprising the human cytomegalovirus major immediate-early promoter and the functionally 
complete 5*-untranslated region of the major immediate-early gene, including the first intron, fused 
directly to a tissue inhibitor of metalloproteinase (TIMP) coding sequence at the translation initiation 
site. 

50 10. A plasmid comprising the human cytomegalovirus immediate-early promoter and the functionally 
complete 5'-untranslated region of the major immediate-early gene, including the first intron. wherein 
the 5'-untransiated region comprises a translation start site which Is coincident with an NC01 restriction 
site. 

55 PatentansprUche 

1. Vektor zur Verwendung In eukaryotischen Zell-Llnien, der eine DNA-Sequenz. umfassend den Promo- 
tor, den Verstarker und die funktlonell vollstSndige 5'-untranslatierte Region, Inklusive des ersten 
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Introns, des grdfieren unmittelbar-frOhen (major immediate-early) Gens des humanen Cytomegalovirus 
enthMIt, wobei die 5*-untranslatierte Region nicht direkt an die DNA-Codierungssequenz des natUrlichen 
grSBeren unmittelbar-frQhen Gens des humanen Cytomegalovirus gebunden ist. 

5 2. Vektor zur Venwendung in eukaryotischen Zell-Linien, der eine DNA-Sequenz enthalt, die den Promoter, 
den VerstSrker und die funktionell voilstSndige 5'-untranslatierte Region, inklusive des ersten Introns, 
des grd0eren unmittelbar-frtihen Gens des humanen Cytomegalovirus enthalt, wobei die 5'-untransla- 
tierte Region direkt an die DNA-Codierungssequenz eines heterologen Gens gebunden ist. 

10 3. Vektor nach Anspruch 2, weicher eine Restriktionsstelle enthSIt, um die Excision und/oder Insertion der 
Codierungssequenz eines heterologen Gens zu gestatten. 

4. Vektor nach Anspruch 1 oder Anspruch 2. wobei die grdOere unmittelbar-fruhe DNA des humanen 
Cytomegalovirus ein Translationslnltiationssignal enthMIt 

5. Wirtszelle, die mit einem Vektor nach irgendeinem der vorhergehenden AnsprUche transfiziert ist. 

6. Verfahren zur Herstellung eines heterologen Polypeptids, bei welchem Verfahren eine Wirtszelle nach 
Anspruch 4 gezUchtet wird. 

20 

7. Verwendung einer DNA-Sequenz, die den Promoter, den VerstSrker und die funktionell voUstMndige 5'- 
untranslatierte Region, inklusive des ersten Introns, des gr60eren unmittelbar- frUhen Gens des 
humanen Cytomegalovirus umfaBt. zur Expression eines heterologen Gens in einer eukaryotischen Zell- 
Linie, wobei die von dem groi3eren unmittelbar-fruhen Gen des humanen Cytomegalovirus stammende 

25 DNA direkt an die Codierungssequenz des heterologen Gens gebunden ist. 

& Plasmid zur Verwendung in einer eukaryotischen Zell-Llnie, das den gr66eren unmittelbar-frQhen 
Promoter des humanen Cytomegalovirus und die funktionell vollstSndige 5'-untranslatlerte Region des 
groBeren unmittelbar-fruhen Gens, inklusive des ersten Introns, direkt an eine Glutaminsynthase- 
30 Codierungssequenz an der Translationsinitiationsstelle tusioniert enthMlt. 

9. Plamid, das den groBeren unmittelbar-frUhen Promoter des humanen Cytomegalovirus und die funktio- 
nell vollstSndige 5'-untranslatierte Region des gr^Beren unmittelbar-frUhen Gens, inklusive des ersten 
Introns, direkt an die Codierungssequenz eines Gewebeinhibitors von Metalloprotelnase (TIMP) an der 

35 Translationsinitiationsstelle fusioniert enthSlt. 

10. Plasmid, das den groBeren unmittelbar-frQhen Promoter des humanen Cytomegalovirus und die 
funktionell vollstMndige 5*-untranslatierte Region des gr^Beren unmittelbar-frQhen Gens, inklusive des 
ersten Introns. enthalt, wobei die S'-untranslatierte Region eine Translationsstartstelle enthSIt, die mit 

40 einer NC01 -Restriktionsstelle zusammenfSllt. 

Revendlcatlons 

1. Vecteur pour utilisation dans des lign^es cellulaires eucaryotes, contenant une sequence d'ADN 
45 comprenant le promoteur, I'activateur et la region non traduite en 5', fonctionnellement complete, 
comprenant le premier intron du g^ne pr^coce-imm^diat majeur du cytomegalovirus humain, dans 
lequel la region non traduite en 5' n'est pas li^e directement k la sequence codante de I'ADN du g^ne 
pr^coce-imm^diat majeur du cytomegalovirus humain naturel. 

50 2. Vecteur pour utilisation dans des lign^es cellulaires eucaryotes. contenant une sequence d'ADN 
comprenant le promoteur, i'activateur et la region non traduite en 5', fonctionnellement complete, 
comprenant le premier intron du g^ne pr^coce-immediat majeur du cytomegalovirus humain, dans 
lequel la region non traduite en 5' est li^e directement k la sequence codante de TADN d'un g^ne 
heterologue. 

55 

3. Vecteur seion la revendication 2, dans lequel le veteur comprend un site de restriction pour permettre 
Texcision et/ou insertion de la sequence codante d'un g§ne heterologue. 
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4. Vecteur selon la revendication 1 ou la revendication 2. dans lequel I'ADN pr^coce-lmm^diat majeur du 
cytomegalovirus humain comprend un signal d'inltiation de traduction. 

5. Cellule hote transfect^e avec un vecteur selon Tune quelconque des revendications pr^c^dentes. 

5 

6. Proc^d^ pour la production d*un polypeptide h^t^rologue, comprenant la culture d'une cellule hdte 
selon la revendication 4. 

7. Utilisation d'une sequence d'ADN comprenant le promoteur, Tactivateur et la region non traduite en 5', 
10 fonctionnellement complete, comprenant le premier intron du ghne pr^coce*imm^diat majeur du 

cytomegalovirus humain, pour Texpression d'un ghue h^t^rologue dans une lign^e cellutaire eucaryote, 
dans laquelle TADN provenant du g^ne prdcoceimm^diat majeur du cytomegalovirus humain est lie 
directement ^ la sequence codante du g^ne het^rologue. 

75 8. Plasmide pour utilisation dans une lign^e cellulaire eucaryote. comprenant le promoteur pr^coce- 
imm^diat majeur du cytomegalovirus humain et la region non traduite en 5', fonctionnellement 
complete, du gene precoce-immediat majeur, y compris le premier intron, soudSe directement ^ une 
sequence codant pour la glutamine synthetase, au site d'initiation de traduction. 

20 9. Plasmide comprenant le promoteur precoce-immediat majeur du cytomegalovirus humain et la region 
non traduite en 5', fonctionnellement complete, du g^ne precoce-immediat majeur, y compris le 
premier intron, soudee directement k une sequence codant pour Tinhibiteur tissulaire de metalloprotei- 
nase (TIMP), au site d'initiation de traduction. 

25 10. Plasmide comprenant le promoteur precoce-immediat du cytomegalovirus humain et la region non 
traduite en 5', fonctionnellement complete, du g^ne precoce-immediat majeur, y compris le premier 
intron, dans lequel la region non traduite en 5' comprend un site d'initiation de traduction qui coincide 
avec un site de restriction NC01. 



30 
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T 

DNS P t AM 
set 8 h fl 
aoy t 3 lu 
111 12 31 
// / 
CCAT66T6TCAA66AC6GT6ACT6CA6T6AATAATAAAAT6T6T6TTT6TCCGAAATAC6 
1 ^ + + + + + 60 

GGTACCACAGTTCCTGCCACTGACGTCACTTATTATTTTACACACAAACAGGCTTTATGC 

CGTTTTGAGATTTCTGTCGCCGACTAAATTCATGTCGCGCGATAGTGGTGTTTATCGCCG 

61 + + + + + + 120 

GCAAAACTCTAAAGACAGCGGCTGATTTAAGTACAGCGCGCTATCACCACAAATAGC6GC 

C 
1 
a 
1 

ATAGAGATGGCGATATTGGAAAAATCGATATTTGAAAATATGGCATATTGAAAATGTCGC 

121 + + + ♦ + + 180 

TATCTCTACCGCTATAACCTTTTTAGCTATAAACTTTTATACCGTATAACTTTTACAGCO 



E 
c 
o 
R 
V 

CGATGTGAGTTTCTGTGTAACTGATATCGCCATTTTTCCAAAAGTGATTTTTGGGCATAC 
181 + + ^. + + + 240 

GCTACACTCAAAGACACATTGACTATAGCGGTAAAAAGGTTTTCACTAAAAACCCGTATG 



c 
o 
R 
V 

GCGATATCTGGCGATAGCGGCTTATATCGTTTACGGGGGATGGC6ATAGACGACTTTGGT 

241 ♦ + + + + + 300 

CGCTATAGACCGCTATCGCCGAATATAGCAAATGCCCCCTACCGCTATCTGCTGAAACCA 

GACTTGGGCGATTCTGTGTGTCGCAAATATCGCAGTTTCGATATAGGTGACAGACGATAT 

301 + + + + + + 360 

CTGAACCCGCTAAGACACACAGCGTTTATAGCGTCAAAGCTATATCCACTGTCTGCTATA 

C BH N C 

£ aa 8 1 

r le i a 

1 11 11 
/ 

GAGGCTATATCGCCGATAGAGGCGACATCAAGCTGGCACATGGCCAATGCATATCGATCT 

361 + + + + + + 420 

CTCCGATATAGCGGCTATCTCCGCTGTAGTTCGACCGTGTACCGGTTACGTATAGCTAGA 



Fig. 4A 
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S C BH 
s f aa 
p r le 
1 1 11 

/ 

ATACATTGAATCAATATTGGCCATTAGCCATATTATTCATTGGTTATATAGCATAAATCA 

421 + + — -+ + + + 480 

TATGTAACTTAGTTATAACCGGTAATCGGTATAATAAGTAACCAATATATCGTATTTAGT 
S C BH 

s f aa 

p r le 

1 1 11 

/ 

ATATTGGCTATTGGCCATTGCATACGTTGTATCCATATCATAATATGTACATTTATATTG 

481 + + + + + + 540 

TATAACCGATAACCGGTAACGTATGCAACATAG6TATA6TATTATACATGTAAATATAAC 

H 

1 M S 
n m p 
c e e 

2 1 1 
GCTCATGTCCAACATTACCGCCATGTTGACATTGATTATTGACTAGTTATTAATAGTAAT 

541 + + -+ + + 600 

CGAGTACAGGTTGTAATGGCGGTACAACTGTAACTAATAACTGATCAATAATTATCATTA 

CAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGG 

601 + + + + + + 660 

GTTAATGCCCCAGTAATCAAGTATCGGGTATATACCTCAAGGCGCAATGTATTGAATGCC 

B A A 

9 ha 
1 at 
1 2 2 

TAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGT 
661 + + + + + + 720 

ATTTACCGGGCGGACCGACTGGCGGGTTGCTGGGGGCGGGTAACTGCAGTTATTACTGCA 

A A 
h a 
a t 
2 2 

ATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTAC 

721 + + + + + + 780 

TACAAGGGTATCATTGCGGTTATCCCTGAAAG6TAACTGCAGTTACCCACCTCATAAATG 

B N 
g d 
1 e 
1 1 
GGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTG 
781 + + + + + + 840 

CCATTTGACGGGTGAACCGTCATGTA6TTCACATAGTATAC6GTTCATGCGGGGGATAAC 



Fig.AB 
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A A B 
ha 9 
at 1 
2 2 1 

ACGTCAATGACGGTAAATGGCCC6CCT6GCATTATGCCCAGTACATGACCTTAT660ACT 
841 + + + + + + 900 

TGCAGTTACTGCCATTTACCGGGCGGACCGTAATACGGGTCATGTACTGGAATACCCTGA 



n DNS 
a set 
B aoy 
1 111 

// 

TTCCTACTTGGCAGTACATCTAC6TATTAGTCATC6CTATTACCATGGTGATGCGGTTTT 

901 + + + + + + 960 

AAGGATGAACCGTCATGTAGATGCATAATCAGTAGCGATAATGGTACCACTACGCCAAAA 

GGCAGTACATCAATGG6CGTGGATAGCGGTTTGACTCAC6QGGATTTCCAAGTCTCCACC 

9ei + + + + + + 1020 

CCGTCAT6TAGTTACCCGCACCTATCGCCAAACTGAGTGCCCCTAAAGGTTCAGA6GTGG 

A A B 

ha a 

at n 

2 2 1 
CCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTC 

1021 + + + + + + 1080 

GGTAACTGCAGTTACCCTCAAACAAAACCGTGGTTTTAGTTGCCCTGAAAGGTTTTACAG 

GTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATA 

1081 + + + + + + 1140 

CATTGTTGAGGCGGGGTAACTGCGTTTACCCGCCATCCGCACATGCCACCCTCCAGATAT 

6H 

BesS G A 

apia s h 

nlAc u a 

2211 1 2 

/// 

TAAGCAGAGCTC6TTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTG 

1141 + + + + -+ + 1200 

ATTCGTCTCGAGCAAATCACTTGGCAGTCTAGCGGACCTCTGCGGTAGGTGCGACAAAAC 

N 

B D BCGsSX 

b s gfdpam 

V a IriBca 

2 1 112223 

//// 

ACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGCGGCCGGGAACGGTGCATTGGAA 

1201 + + + + ♦ + 1260 

TGGAGGTATCTTCTGTG6CCCTGGCTAGGTCGGAGGCGCCGGCCCTTGCCACGTAACCTT 



Fig. AC 
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CGCGGATTCCCCGTGCCAA6AGTGACGTAAGTACCGCCTATAGAGTCTATAGGCCCACCC 

1261 + + + + ♦ + 1320 

GCGCCTAAGG6GCACGGTTCTCACTGCATTCATGGCGGATATCTCAGATATCCGGGTGGG 

B N 

Ss N SS 

tt S pp 

yX i Hh 

11 1 11 

/ 

CCTT6GCTTCTTATGCATGCTATACTGTTTTTGGCTTGGGGTCTATACACCCCCGCTTCC 

1321 + * + + + 1380 

G6AACCGAAGAATACGTACGATATGACAAAAACCGAACCCCAGATATGTGGGGGCGAAGG 

E 
8 

P 
1 

TCATGTTATAGGTGATGGTATAGCTTAGCCTATAGGTGTGGGTTATTGACCATTATT6AC 

1381 + + + + + + 1440 

AGTACAATATCCACTACCATATCGAATCGGATATCCACACCCAATAACTGGTAATAACTG 

P 
f 
1 
M 
1 

CACTCCCCTATTGGTGACGATACTTTCCATTACTAATCCATAACATGGCTCTTTGCCACA 

1441 ^ + • — + + + + 1500 

GT6AG6GGATAACCACTGCTATGAAAGGTAATGATTAGGTATTGTACCGAGAAACGGTGT 

E 

c 
o 

5 
7 

ACTCTCTTTATTGGCTATATGCCAATACACTGTCCTTCAGAGACTGACACGGACTCTGTA 

1501 + + + + + ♦ 1560 

TGAGAGAAATAACCGATATACGGTTATGTGACAGGAAGTCTCTGACTGTGCCTGAGACAT 

E 
c 
o 
3 
1 

TTTTTACAGGAT6GGGTCTCATTTATTATTTACAAATTCACATATACAACACCACCGTCC 

1561 + + + + + 1620 

AAAAATGTCCTACCCCAGAGTAAATAATAAATGTTTAA6TGTATAT6TTGTGGTGGCAGG 

B 

8 X A A 

p h V f 

1 o a 1 

2 2 13 
CCAGTGCCC6CAGTTTTTATTAAACATAACGTGGGATCTCCACGCGAATCTCGGGTACGT 

1621 + + + + + + 1680 

GGTCACGGGCGTCAAAAATAATTTGTATTGCACCCTAGAGGTGCGCTTAGAGCCCATGCA 



Fig. AD 
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B B B 

5 Bs B8 

P ap ap 

M nl nl 

2 22 22 

/ / 
GTTCCGGACATGGGCTCTTCTCCGGTAGCGGCGGAGCTTCTACATCCGAGCCCTGCTCCC 

1681— -+ + + ♦ +- + 1740 

CAAGGCCTGTACCCGAGAAGAGGCCATCGCCGCCTC6AAGATGTAGGCTCGGGACGAGGG 

G H 
8 a 
u e 
1 1 

ATGCCTCCAGCGACTCATG6TCGCTCG6CAGCTCCTT0CTCCTAACAGT66AG6CCAGAC 
1741: + + + + + + 1800 

TACGGAGGTCGCTGAGTACCAGC6AGCCGTC6AG6AACGA6GATTGTCACCTCCGGTCTG 

D 
8 

a 
1 

TTAGGCACAGCACGATGCCCACCACCACCAGTGTGCCGCACAAGGCCGTGGCGGTAGGGT 

1801 + + + + ♦ + 1860 

AATCCGTGTCGTGCTACGGGTGGTGGTGGTCACACGGCGTGTTCCGGCACCGCCATCCCA 

BH N 
ABsgS s A B 

vapia p f b 

anlAc B 1 v 

12211 2 2 2 

/// 

ATGTGTCTGAAAATGAGCTCGGGGAGCGGGCTTGCACCGCTGACGCATTTGGAAGACTTA 

1861 + + -+ + + + 1920 

TACACAGACTTTTACTCGAGCCCCTCGCCCGAAC6TGGCGACTGCGTAAACCTTCTGAAT 

N N 
s sP 
P PV 
B Bu 
2 22 

/ 

AGGCAGCGGCA6AAGAAGATGCAGGCAGCTGAGTTGTTGTGTTCTGATAAGAGTCAGAGG 

1921 + -¥ 4- + + + 1980 

TCCGTCGCCGTCTTCTTCTACGTCCGTCGACTCAACAACACAAGACTATTCTCAGTCTCC 

H 

IH S 
np c 
ca a 

21 1 

/ 

TAACTCCCGTTGCGGTGCTGTTAACGGTGGAGGGCAGTGTAGTCTGAGCAGTACTCGTTS 

1981 + + + + + + 2040 

ATTGAGGGCAACGCCACGACAATTGCCACCTCCCGTCACATCAGACTCGTCATGAGCAAC 



Fig.AE 
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B B 

s s DNS 

S 8 set 

K H aoy 
2 2 111 

// 

CTGCCGCGCGCQCCACCAGACATAATAGCTGACAGACTAACAGACTGTTCCTTTCCATGG 

2041 + + ♦ + + + 2100 

GACGGCGCGCGCGGTGGTCTGTATTA7CGACTGTCTGATTGTCTGACAAGGAAAGGTACC 

P DNS 
8 set 
t aoy 
1 111 

// Ncol 

GTCTTTTCTGCAGTCACCGTCCTT6ACACCATG | 
2101- 

CAGAAAAGACGTCAGTGGCAGGAACTGTG 



// nc 
ACCATGJ 

TG I 
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